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Abstract 

Advances in Micro-Electro-Mechanical Systems (MEMS) have revolu- 
tionized the digital age to a point where animate and inanimate objects can 
be used as a communication channel. In addition, the ubiquity of mobile 
phones with increasing capabilities and ample resources means people are 
now effectively mobile sensors that can be used to sense the environment as 
well as data carriers. These objects, along with their devices, form a new 
kind of networks that are characterized by frequent disconnections, resource 
constraints and unpredictable or stochastic mobility patterns. A key under- 
pinning in these networks is routing or data dissemination protocols that are 
designed specifically to handle the aforementioned characteristics. There- 
fore, there is a need to review state-of-the-art routing protocols, categorize 
them, and compare and contrast their approaches in terms of delivery rate, 
resource consumption and end-to-end delay. To this end, this paper reviews 
63 unicast, multicast and coding-based routing protocols that are designed 
specifically to run in delay tolerant or challenged networks. We provide 
an extensive qualitative comparison of all protocols, highlight their experi- 
mental setup and outline their deficiencies in terms of design and research 
methodology. Apart from that, we review research that aims to exploit stud- 
ies on social networks and epidemiology in order to improve routing proto- 
col performance. Lastly, we provide a list of future research directions. 

Keywords: Challenged networks, routing, disruption tolerant networks, epi- 
demic, data dissemination 
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1 Introduction 



Delay or Disruption Tolerant Networks (DTNs) are characterized by long delays 
and intermittent connectivity. Moreover, they may have power constraints, low 
and asymmetric bandwidth, and high bit-error rates HIED 010. To illustrate 
some of these characteristics, consider forming a network using the vehicles of 
Figured] All vehicles, e.g., buses and cars, are equipped with a radio transceiver, 
which allows them to communicate with each other, and also to access points, 
which have connectivity to the Internet and are planted strategically in differ- 
ent parts of the city. In this network, all vehicles will help each other forward 
messages to each other and also to access points. Given the limited transmis- 
sion range, the intermittent connectivity of vehicles and location of access points, 
messages will experience significant delays. Another key characteristic is the so 
called store-carry-forward model used to propagate messages. That is, a vehicle 
may have to store and carry a message for some distance before encountering and 
passing the message onto another vehicle or access point. In this regards, a key 
mobility pattern that can be exploited by routing protocols is the predictable mo- 
bility pattern and schedule of buses. In addition, any routing protocols will have 
to consider the link capacity and duration of each connection, which is governed 
by channel condition and vehicle speed. 

We can also form a DTN using people. This can be easily realized given the 
ubiquity of smart phones equipped with a plethora of sensors and transceivers. 
Hence, they can be used to monitor traffic, crowd, air pollution and spread of 
diseases to name a few. Unlike vehicle based DTNs, smart devices have resource 
constraints; e.g., limited battery life. Moreover, people will have varying contact 
duration and frequency. That is, their movement pattern will be less predictable 
than vehicles. Consider User- A in Figure Q] who wants to send a file to one or 
more students attending the University of Wollongong (UoW). Also shown is a 
possible transmission path, which depends on encounters with other users of the 
DTN. Inevitably, the resulting topology or path taken will be random in nature 
and changes with space and time. More specifically, it is difficult to predict as 
it depends on the mobility patterns of people. Interestingly, the authors of 11291 
showed that nodes/students who are attending the University of Cambridge are 
not always connected, and hence they experience large delays between meetings. 
For example, nodes/students may connect during a class, and disconnect when the 
class finishes. The next class may be eight hours away. Secondly, the movement 
of nodes and encounter duration are random. For instance, two good friends may 
remain in contact for a lot longer as compared to other students. Thirdly, nodes 
exhibit a mobility pattern that coincides with meeting times, e.g., lectures, and 
path to a given classroom. 

As we can see from Table [Q researchers have proposed various DTNs and 
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Figure 1 : An example comprising of DTNs formed by vehicles and people. 

applications. For example, in ZebraNet 0, zebras have custom sensors that track 
their movement patterns and locations. A device carried by a person or a mobile 
base station is then used to collect the said tracking data. As mobile base sta- 
tions have limited communication range, zebras exchange information with other 
zebras until they encounter a mobile base station. Given that zebras and a mo- 
bile station rendezvous randomly, i.e., they may not meet each other for days or 
weeks, tracking data incurs significant delays before scientists are able to col- 
lect them. Moreover, as the movements of zebras are unpredictable, links are 
established intermittently and hence there are no end-to-end paths from zebras to 
mobile base stations. In addition, ZebraNet also has storage, bandwidth and en- 
ergy constraints. Specifically, the sensors on each zebra have a lifetime of only 
one month, are equipped with a 1MB flash Random Access Memory (RAM) and 
have a data transmission rate ranging from 2.4 to 19.2 kilobits per seconds. An- 
other wildlife example is SWIM Q, where a sensor network is used to monitor 
whales. SWIM combines two kinds of nodes: (i) sensors, and (ii) infostations. 
The sensors are attached to whales, and the infostations are used to collect data 
from passing whales. In [7], sensors networks are used to monitor water pollution 
and noise level in urban areas. 

There are also a number of applications that involve special nodes. For ex- 
ample, DakNet [[81 uses vehicles or data ferries to provide low-cost data delivery 
between rural villages. In each village, a kiosk is used by villagers to store mes- 
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sages and send data to visiting data ferries, which then uploads the data they have 
collected onto the Internet. As data is carried by data ferries, it experiences a 
much higher delay than conventional networks. To clarify, the delays incurred by 
messages are affected by several data ferry parameters: (i) routes taken to reach 
a village, (ii) schedules, (iii) speed, (iv) number of ferries, and (v) distance be- 
tween kiosks. In a different work, the Pollen network uses humans as data 
ferries, where mobile devices carried by humans exchange information with each 
other, and/or with a fixed network. Other projects that use data ferries include 
KioskNet OH, MotoPost 0J], Wizzy Net (El, Widernet flU, Digital Study Hall 
(HI, DigitalGreen 031, Body Sensors O, and Digital Polyclinic OH. 

The aforementioned characteristics require novel routing protocols that ad- 
dress the following challenges: 

• Stochastic and dynamic topologies, as nodes are mobile and can engage in 
various mobility patterns lfT8ll . For example, nodes may be vehicles on a 
freeway or wild animals roaming in a national park. These unstable topolo- 
gies lead to unpredictable, uncontrolled movements, large delays and arbi- 
trary disconnections. 

• Limited topological information, which compounds the difficulty of finding 
routing metrics that accurately reflect network conditions. A path from a 
source to a destination can be static or dynamic. However, as pointed out 
by ffTTl . without topology information, static routes are not suitable for dy- 
namic topologies. Another challenge is the lack of up to date topological 
information which can be used to calculate the best path to a given destina- 
tion. Therefore, as will be evident later, dynamic routing protocols tend to 
use local metrics, such as the number of times nodes have encountered each 
other. 

• Variable and uncertain connection duration. Routing protocols need to de- 
cide whether to transmit all or a subset of bundles when nodes encounter 
each other. For example, as zebras in ZebraNet O meet for a limited time 
period, the routing protocol has to decide which data to forward in order to 
maximize delivery probability. 

• Limited resources, which requires protocol designs to be efficient. In other 
words, nodes must utilize their limited hardware resources such as CPU, 
memory and battery efficiently. For example, in WSNs, nodes can be lo- 
cated in an open environment for years before data are collected, and hence 
requires nodes to carefully manage their energy usage. Additionally, a good 
routing protocol will leverage the resources of multiple nodes. For exam- 
ple, nodes may choose to shift some of their stored bundles to other nodes 
to free up memory or to reduce transmission cost. 
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Table 1: A sample of DTN applications. 
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To date, there are only two prior literature reviews |[T9l EDI . Our paper extends 
these works in the following manner: 

• We cover 42 new protocols. Specifically, there are 18 new unicast protocols 
as well as 10 multicast routing protocols. In addition, we review 14 routing 
approaches that incorporate either erasure or network coding in order to 
improve bundle delivery ratio and robustness. Note, both prior works do 
not cover multicast nor coding-based routing protocols. 

• We present extensive qualitative comparisons of all protocols. In particular, 
we categorize protocols according to their key feature and compare them in 
terms of their routing policies, resource constraints and requirements, ad- 
vantages and deficiencies. These are significant contributions as past works 
have only carried out a small number of comparisons using a limited set of 
criteria. 

• We highlight and summarize works that seek to borrow concepts from social 
networks and epidemiology in order to improve routing performance. These 
works are particularly critical to future researchers given the ubiquity of 
smart phones with increasing computation power and storage. 

• Lastly, we list future works that have not been identified elsewhere. One 
of which is a key limitation in regards to the research methodology used 
by existing works that hinders the objective and quantitative evaluation of 
routing protocols. Others include the development of multicast protocols 
and information retrieval systems for DTNs. 

The rest of the paper is structured as follows. Section [2] first presents an 
overview and taxonomy of routing protocols before delving into the details of 
epidemic routing protocols, and their variants, in Section 12.21 This is then fol- 
lowed by data ferries based protocols in Section [2731 That is, routing protocols 
that assume the existence of special nodes with ample resources and deterministic 
trajectories. After that in Section [2~4l we review protocols that dynamically main- 
tain historical information of past encounters to aid its future forwarding deci- 
sions. In Section [23l we review novel approaches that employ erasure or network 
coding. Then in Section [3l we review multicast routing protocols before review- 
ing research areas in social networks and epidemiology in Section HI Finally, in 
Section [51 we present our conclusions. 

2 Unicast Routing Protocols 

In the forthcoming sections, we will use the term bundle as the unit of exchange 
between nodes. Note that the IETF/IRTF defines bundle as the metadata that is 
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used for wrapping information from other layers and to compress them into a data 
block. Moreover, DTN routing protocols are independent of the delay tolerant 
networking architecture presented in 0. 

2.1 Overview 

Current DTN routing protocols can be divided into three categories: (i) epidemic, 
(ii) data ferries and (iii) statistical. Table [2] summarizes the key characteristics of 
each category. We see that the three categories have different features, advantages 
and disadvantages. 

Nodes in category (i) are assumed to have uniform resource and movement 
patterns. Moreover, they cooperate to route bundles for their neighbors. For ex- 
ample, in Figure [2l at t = 1, assume node B wants to forward a bundle to node-E. 
One approach that node-B can adopt is to simply forward the bundle to any nodes 
it encounters; i.e., floods the bundle as widely as possible. We see that the bundle 
arriving at node-E following the path B-C-A-E at t = 3. Unlike conventional 
networks, this path, however, is likely to change in subsequent bundle transmis- 
sions. In this respect, the key research objective, as we discussed in Section [2T2l 
is to design an efficient flooding based protocol that meets the following goals: a) 
message delivery rate is maximized, b) message latency is minimized, and c) the 
total resource, especially memory or energy expenditure, consumed is minimal. 
In general, epidemic routing protocols have low delays, but high resource con- 
sumption. Hence, designing a buffer management policy that balances resource 
consumption and delivery ratio is a fundamental problem. Moreover, said policy 
needs to purge staled bundles reliably. Otherwise, prematurely removing bundles 
may have a negative impact on delivery ratio. 
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Figure 2: A DTN with homogeneous nodes. A solid line shows connectivity 
between nodes. 

Routing protocols in category (ii) take advantage of resource rich, mobile 
nodes called Data Ferries (DFs), which act as a communication channel between 
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nodes or disparate networks. These networks could be located on different plan- 
ets or represent rural villages. Examples of DFs based DTNs are shown in Figure 
[3] We see four networks that are serviced by a DF (bus), which tours each net- 
work periodically. Note that one can reduce delays further by adding more DFs 
or buses. Each network, e.g., Net-Y, can be serviced by an independent DF. That 
is, instead of networks, a DF (car) is tasked with collecting data from nodes di- 
rectly. Apart from that, nodes can rely on wireless communications, i.e., Net-Z, 
and only use a DF for inter-cluster communications; see Section 12.3.51 for more 
details. The figure also illustrates two kinds of data ferry movements: active and 
passive. In the former, data ferries actively approach source or destination nodes; 
e.g., the bus. In the latter model, nodes intentionally move toward data ferries, 
as illustrated by nodes in "Net-W". In both types of data ferries, they have a pre- 
dictable schedule which a routing protocol can then exploit to provide some form 
of Quality of Service (QoS) guarantees. 




Figure 3: Example of DF based DTNs. 



The last category, i.e., (iii), of routing protocols utilizes statistical methods to 
avoid arbitrary flooding. The key assumption is that nodes in a DTN will always 
encounter one another. Moreover, nodes are homogeneous in that they have simi- 
lar resources. Hence, each node can compile a set of statistics or metrics regarding 
its rendezvous time with other nodes. For example, in Figure [2l we see that node 
A has a connectivity. That is, it meets other nodes frequently, and thereby, mak- 
ing it ideal as a bundle carrier. As we will elaborate in Section 12.41 nodes may 
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forward a bundle to a neighbor based on statistical information such as next hop 
forwarding probability. Other metrics include the number of times a node has en- 
countered a given node, and the duration in which a node remains connected to 
a given neighbor. Moreover, routing protocols may consider the storage capacity, 
energy, bandwidth and/or type of nodes. As a result, routing protocols in this cat- 
egory result in nodes with lower buffer occupancy consumption. The downside, 
however, is gathering invariant properties of a DTN which a node can then exploit 
to forward its bundles. 

In the next section, we first outline epidemic routing protocols and their op- 
timizations related to buffer management. Then in Section [2T3l we survey works 
that exploit and control special nodes or DFs. Lastly, in Section [2~4l we expound 
works that exploit mobility patterns. 

2.2 Epidemic 
2.2.1 Pure 

Vahdat et al. ||2T1| proposed the first epidemic routing protocol for DTNs, which is 
based on an epidemic algorithm originally developed for updating databases 11221 . 
Each node maintains a bundle summary vector describing each bundle's destina- 
tion, length and hop count. Whenever two nodes encounter each other, they begin 
an anti-entropy session, where they compare their bundle summary vector to as- 
certain missing bundles. Nodes stop their anti-entropy session when they have the 
same bundle summary vector; i.e., same set of bundles. Each bundle also contains 
a hop count that corresponds to its priorities, and is also used to constrain flooding. 
Apart from that, nodes allocate a dedicated buffer space for non-local bundles. In 
their experiments, the authors compared the delivery rate when nodes are allocated 
varying buffer sizes, and when bundles have different hop counts. Their results 
show that using an infinite buffer size results in the fastest message delivery, and 
achieves 100% bundle delivery with an average time of 147.3 seconds. With a 
buffer size of 1MB, 500KB and 200KB, they only observe up to 0.4% and 10% 
degradation in average delivery rate and maximum delivery times respectively. 
Interestingly, when the buffer size shrinks to 2.5% of its original capacity, the 
delivery rate reduces to 79.7%, and only 29.3% when nodes have a 10KB buffer 
size. Their results also show that protocol delivery rate is affected by bundles hop 
count. When hop count is changed from eight to one, delivery rate drops from 
100% to 80%. In general, their results show that buffer size has a non-negligible 
impact on bundle delivery ratio. Although these results confirm epidemic routing 
has good performance in DTNs, it suffers from the following problems. Firstly, 
in scenarios where nodes have a large summary vector or number of bundles, a 
short contact period may prevent nodes from exchanging their summary vector 
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Table 2: A comparison between DTNs routing protocols categories 
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successfully and complete an anti-entropy session. Secondly, nodes do not pref- 
erentially discard bundles during congestion. Thirdly, its performance is limited 
when nodes have memory constraints. 

2.2.2 Optimizations 

To date, researchers have proposed two main ways to reduce the memory con- 
sumption of nodes: anti-packets, limited copies, and metrics. 

Anti-packets 

An anti-packet is generated by a destination node once it has received a bundle 
successfully. In addition, anti-packets are used to eliminate duplicated bundles. 
That is, each anti-packet is paired with a bundle, analogous to "infection and 
vaccination" pair in epidemiology. Note, anti-packet is analogous to vaccination 
Il23ll . [|24l . immunity Il25ll or cure Il26ll . Hence, upon receiving an anti-packet, 
nodes check their bundle list and delete the corresponding bundle. 

The (p, q) -epidemic with vaccination routing protocol or PQERPV ll23l trans- 
mits bundles with varying probability and avoids duplicating bundles. Specifi- 
cally, nodes store bundles from their encounters with probability p and q, where 
p = q = means nodes do not store any bundles, and p = q = 1 indicates 
nodes are to receive all bundles. Nodes that are neither sources nor destinations 
are called relay nodes. Moreover, nodes that carry bundles on behalf of other 
nodes are termed "infected relay nodes". The value of p and q is set according to 
node types. Specifically, relay nodes store bundles from sources with probability q 
and store bundles from infected relay nodes with probability p. Destination nodes 
initiate a vaccination process once they have received a bundle correctly. Specifi- 
cally, they flood anti-packets to delete the corresponding bundle buffered at relay 
nodes. This protocol has the following limitations. Firstly, the flooding of anti- 
packets does not consider resource consumption. Secondly, the speed in which 
source nodes send new bundles is governed by the dissemination of anti-packets. 
In other words, source nodes will not send new bundles until they have the cor- 
responding anti-packets for bundles sent previously. In effort to reduce resource 
consumption, in [|37l . the authors combine PQERPV with an additional metric 
called Time-to-Live (TTL) to control and eliminate delivered bundles. Their re- 
sults show that when TTL is set to 10 minutes, nodes store on average 27% of 
transmitted bundles as compared to 19% when their TTL value is set to five min- 
utes. This is because nodes are more likely to remove expired bundles. 

Another protocol that utilizes anti-packets is Epidemic Routing Protocol with 
Immunity [|25l[|27l . This protocol delivers and drops bundles according to two 
lists: m-list and i-list. m-list is similar to the summary bundle vector in epidemic 
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routing protocol [21], where it records a bundle's ID and destination, i-list main- 
tains the bundle IDs that have arrived at their respective destination. When two 
nodes encounter each other, they combine their i-list and exchange those bundles 
that are not in their i-list. Each bundle that successfully arrives at its destination 
triggers nodes to update their i-list. A key concern, however, is that the size of 
i-list will increase following the successful delivery of bundles. Moreover, the 
authors did not specify any i-list management policies. 

Scheme for Epidemic Routing with Active Curing (SERAC) ll26ll proposes 
a faster anti-packet transmission and efficient buffer management scheme. The 
main idea, called active curing, is to prioritize the transmission of acknowledg- 
ment (ACK) messages, and thereby propagate them quicker throughout the net- 
work. Additionally, SERAC recalculates a new route when forwarding ACKs so 
that they follow the "best" path given the current network state. This, however, 
consumes more resources in terms of memory and CPU computation. Besides 
that, to minimize the size of ACKs, SERAC uses two bytes to represent the se- 
quence of bundles that has arrived at a destination. However, when bundles are 
fragmented or small in size, these two bytes overhead will be significant. 

Limited Copies 

Spyropoulos et al. 11281111301 propose the Spray and Wait routing protocol. Its rout- 
ing process can be split into two parts: (i) spray phase, where L copies of a bundle 
are initially forwarded by a source to L neighbors, and (ii) in the wait phase, these 
L neighbors relay a copy of the message only when they encounter the destina- 
tion. The same authors also introduce an improvement called Binary Spray and 
Wait scheme, in which each node transmits half of their bundles they have to any 
encountered nodes. For example, a source node with L = 10 will transmit five 
bundles to another node-A, and keeps five bundles for itself. This process is then 
repeated for any nodes that the source and node-A meet in the future. Their ex- 
periments involving 100 nodes show that with L increasing from five to 20 the de- 
livery delay decreases by approximately 42%. Under the same condition, Binary 
Spray and Wait has a higher performance, where delivery delay ranges from 3500 
to 1500 seconds. The main limitation with this protocol is that a maximum of two 
hops is used to deliver bundles. Hence, in large DTNs, a bundle may incur a sig- 
nificant delay as it can only be delivered when a relay or source node encounters 
the destination. In OTTl . Bulut et al. consider how bundle copies are distributed to 
relays given a time constraint. The main approach is to have a number of periods, 
each with increasing "urgency" corresponding to a bundle's deadline. Initially, a 
source sends out a small number of copies, and waits for an acknowledgment. If 
delivery fails, the source sprays additional copies to nodes that have yet to receive 
a copy of the bundle. Hence, with each passing period, more copies are generated 
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to ensure a bundle is delivered. The authors show via analytical and simulation 
studies that multiple periods reduce the number of bundle copies required to meet 
a given deadline. Their work, however, assumes acknowledgments are forthcom- 
ing in each period to facilitate bundle transmission. In addition, acknowledgments 
are forwarded using epidemic routing, which incurs high overheads. 

Energy is an important issue for DTNs comprising of battery constrained de- 
vices; e.g., sensors attached to animals 0. To this end, Li et al. |32] and Altman 
et al. 11331 study energy efficient forwarding policies for these types of DTNs. 
That is, limit bundle copies so that nodes incur minimal energy expenditure asso- 
ciated with transmission and reception. In [32], the authors consider a two-hop 
forwarding model, whereas the latter work also considers probabilistic epidemic 
forwarding. In both works, the goal is to seek to design policies that improve 
bundle delivery whilst adhering to a given energy budget or bounded transmission 
times. The key control parameter is the probability of transmission. For example, 
in two-hop forwarding, a source forwards a message to another node at time t 
with probability p(t). Using an extensive analytical framework, Li et al. showed 
that setting p(t) according to the following policy to be optimal in terms of en- 
ergy efficiency: given a time threshold t , set p(t) = 1 if t < t , and p(t) = 
otherwise. 

Metrics 

In this optimization, nodes compute a metric that reflects the current network state, 
such as the number of times a node has encountered another node or contact du- 
ration, in order to evaluate a neighbor's ability in delivering a bundle successfully. 

James et al. Il34ll developed an epidemic routing protocol that uses a buffer 
management strategy called Drop-Least-Encountered (DLE). The authors utilize 
Encountered Count (EC) as a metric to decide which bundles to drop when a 
node's buffer is full. Specifically, nodes drop bundles with the smallest EC. Apart 
from that, the value of EC degrades with time. For example, if two nodes meet 
only once, their EC value will decline from one to zero gradually. Additionally, 
nodes can also learn the EC metric of a neighboring node's past encounters. Con- 
sider node A, B and C. When node A encounters node C after meeting node B, 
node C can also learn the EC metric of node B and A. However, this protocol 
still uses flooding to transmit bundles, and it does not consider bundles priority. 
Consequently, high priority bundles may be dropped when a node's buffer is full. 

Prioritized Epidemic (PREP) routing protocol II35TI gives preference to bun- 
dles according to a cost value that is computed according to their respective des- 
tination, source and expiry time. In addition, PREP maintains a high replication 
density when bundles approach their destination. It consists of two main compo- 
nents: topology awareness and bundle drop/transmit processing. The topology 
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awareness component updates a link metric called Average Availability (AA), 
which is defined as -jr-, where T up is the total time when the link is up, Tj is 
the time when the link is available. A bigger AA value means a link has a higher 
utility and stability. PREP marks bundles that are further away from their destina- 
tion as low priority. This means nodes maintain a high bundle density as bundles 
approach their destinations. The main problem with PREP is that in highly dy- 
namic topologies, nodes will announce updated AA frequently, which slows route 
convergence. 

2.2.3 Discussion 

Table[3]summarizes epidemic based routing protocols. We see that since Vahdat et 
al. [EU's seminal work, a number of works have been proposed to address many of 
its limitations. They range from the use of probabilistic metrics to "intelligently" 
forward bundles to the removal of packets via anti-packets or expiration time. 
The main limitations of these protocols are that they assume all nodes have the 
same capabilities or properties, and consume a significant amount of resources. 
In the next section, we outline protocols that exploit resource rich nodes that have 
specific trajectories to help propagate bundles. 

2.3 Data Ferries 

The second category of routing protocols involves the use of Data Ferries (DFs). 
To date, there are 15 protocols that employ DFs. In general, any nodes can be DFs; 
viz. hand-held devices, vehicles, animals, and satellites. DFs follow a predictable 
or unpredictable pattern. For example, buses move periodically but with a period 
that ranges from a few minutes to several hours. On the other hand, wild animals 
have stochastic movement patterns. Lastly, a DTN can have one to hundreds of 

DFs mil ga. 

We will first review works that investigate the general benefits of DFs in Sec- 
tion 12.3.11 Following that, in Section I2.3.2L we present works that make use of 
vehicles as DFs. Then in Section [2.3.3L we survey works that use DFs to reduce 
the energy consumption of nodes. After that, in Section [2.3 .4[ we outline the use 
of DFs in deep space. Lastly, Section [2 . 3 . 5 1 review s works that consider group of 
nodes, each of which is served by one or more DFs. 

2.3.1 General 

The Message Ferrying (MF) PTTl scheme classifies MF movement and routing 
into two categories: (i) Node-Initiated MF (NEV1F), in which DFs move around a 
deployed area according to known routes and communicate with other nodes they 
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Table 3: A summary of epidemic routing protocols. 



Protocol 


Techniques to Minimize Flooding 


Key Feature 


Pure Epidemic 11211 


Transmit randomly 


Duplicate bundles until they are delivered 




Transmit according to probability 
p and q 


Use of anti-packets to erase duplicated bundles 


Epidemic with 
Immunity ll25l 


Transmit bundles randomly 


Use of anti-packets to erase duplicated bundles 


SERAC EH 


Prioritized ACK delivery 


Epidemic with DLE li34l 


Combines buffer management with encounter times 


PREPfl5H 


Relies on the Open Shortest Path 
Forwarding ll38l routing protocol 
to compute a least cost path 


Paths have a cost value, and bundles are forwarded along the path with the 
the least cost 


Epidemic with TTL 071 


Transmit according to a probability value 


Bundles have an expiration time that is determined by their TTL value 


Spray and Wait dHEa 


Limited copies. 


Bounded bundle copies. Two-hops delivery as relays only forward bundles 
directly to a destination. 


Stochastic 15211551 


Dynamic adjustment of forwarding 
probabilities 


Considers the energy expenditure of bundle transmission and replication. 



meet, and (ii) Ferry-Initiated MF (FIMF), in which DFs move pro-actively to meet 
nodes. The authors evaluate the performance of MF according to their data deliv- 
ery rate and energy consumption. Their results demonstrate that MF has a higher 
delivery rate as compared to epidemic routing protocol, and advantageously con- 
sume less energy. Moreover, MF transmits 30 times more data per Joule of energy 
than epidemic routing protocol. 

In 11391 , the authors investigate the relationship between the number of DFs and 
routing performance. Specifically, they report on the use of a single and multiple 
DFs. The used the following equation to calculate the delay incurred by a DF to 
collect data iflTTl : 

jj _ 52o<i,j<n w ijdjj 
2Jo<iJ<n w ij 

where n denotes the number of stationary nodes, iu is the data transmission rate, 
and d is the delivery delay of each data fragment. Note that the problem of com- 
puting the DF path that minimizes delay is effectively the well known traveling 
salesman problem. Their approach, however, has only been applied in DTNs with 
static nodes. 

In their subsequent work Il40l . the authors investigate the use of multiple DFs, 
and the following routing schemes: single route (SIRA), multiple routes (MURA), 
relaying between ferries indirectly (NRA), and directly (FRA). The DFs in SIRA 
travel on the same trajectory, but are on different trajectories in MURA. The dif- 
ference between NRA and FRA is whether stationary nodes are used as relay 
nodes between DFs. Their experimental results show that MURA is able to take 
advantage of routes with the least delay. Also, increasing the number of DFs is 
also beneficial for reducing the buffer consumption of DFs. Their experiment 
only considers the scenario where data ferries operate in semi- static topology. 
However, in high mobility scenarios, data ferries are required to track nodes, and 
coordinate the delivery process with each other - a task that incurs high signaling 
overheads. 

2.3.2 Vehicular Networks 

The authors of Il42l propose Mobility-Centric Data Dissemination Algorithm for 
Vehicle Network (MDDV), which they then apply to Vehicle-to- Vehicle (V2V) 
networks with the following characteristics: (1) predictable and high mobility, (2) 
dynamic and rapidly changing topology, (3) constrained and mostly one-direction 
movements, (4) potentially large-scale, (5) frequent disconnections, (6) vehicles 
that are not completely reliable, and (7) no significant power drain. MDDV ex- 
ploits the mobility of vehicles to deliver bundles and combines opportunistic and 
trajectory forwarding as vehicles only encounter each other occasionally, move 
along streets or roads, and are aware of their geographical location - as provided 
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by the Global Position System (GPS). Nodes using MDDV send bundles to neigh- 
boring nodes that are geographically closest to the region that contains the desti- 
nation. These nodes then flood the bundles upon entering the designated region. 
This research has two key issues. Firstly, there is no route recovery mechanism. 
That is, the authors have not considered the case when bundles fail to be deliv- 
ered due to data ferries running out of patrol. Secondly, the use of flooding is 
prohibitive when nodes have a high mobility. 

Another routing protocol is Meeting- Visit (MV) ll43l . Nodes build a geograph- 
ical information profile whenever they meet peers and visit different geographical 
locations. MV works as follows. When two nodes meet each other, they will 
first exchange a list of bundles they are carrying as well as their corresponding 
destination. In addition, they will annotate these bundles with a delivery proba- 
bility. After that, the pair of nodes will sort bundles according to their delivery 
probability. Bundles that are headed in the opposite direction will be allocated the 
lowest probability. They then select the top n bundles with the highest delivery 
probability. The delivery probability is calculated as follows: 

N 

Pn{i) = 1 - IK 1 - m jk PLi(i)) (2) 

where P%(i) denotes the success probability that node k transmit n bundles, N 
is the number of nodes, m jfc represents the probability that node j and k visit the 
same location simultaneously. 

MV also introduces four kinds of controllers: (i) total bandwidth, which chooses 
the peer which has not encountered other nodes for the longest period of time, (ii) 
unique bandwidth, which chooses the peer that has the largest number of messages 
not present anywhere else in the network, (iii) delivery latency, which chooses the 
peer whose average delivery time is the largest, (iv) peer latency, which chooses 
the location least recently visited by a peer. A multi-objective controller then al- 
locates a different priority to the aforementioned controllers. Their experiments 
show that although nodes experience higher latencies, MV can deliver messages 
with a high success ratio - i.e., 83% of the maximum achievable delivery rate 
with minimal duplicated bundles. However, their experiments show that MV has 
marginal performance as delivery rate does not improve significantly with increas- 
ing node density. 

Zarafshan-Araki et al. lfT6l propose TrainNet, a system that uses trains as DFs 
to transport massive amount of data between stations on a single railway track. A 
key problem observed is storage optimization, where the storage capacity of hard 
disks being loaded and unloaded from each station becomes a bottleneck given 
that trains only stop for a relatively short period of time at each station. They 
propose four scheduling algorithms to manage the said hard disks: (i) First Come 
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First Serve (FCFS), in which data is loaded onto trains according to their arrival 
time; (ii) Local Max-Min Fair Algorithm (LMMF), in which the farthest stations 
are allocated more capacities on train's hard disks; (iii) Global Max-Min Fair 
(GMMF), in which equal storage capacities are allocated to downstream stations; 
(iv) Weighted Global Max-Min Fair (WGMMF), in which data is prioritized by 
weight. Their results show that WGMMF has the best hard disk utilization. Note 
that WGMMF also has the highest average throughput, which is significantly more 
than GMMF. This work suffers from the following limitations: (i) intermediate 
stations cannot be used for storing data, (ii) the protocol does not support train-to- 
train communication, and (iii) the work only considers trains travelling in a single 
railway track. 

2.3.3 Energy 

Another key issue in DTNs is the energy constraint of nodes, which has a signif- 
icant impact on a node's ability to deliver bundles. Zhu et al. [44] present two 
algorithms, called Least Energy Tree (LET) and Minimum Hop Tree (MHT), to 
provide energy efficient message ferrying in wireless sensor networks. Both LET 
and MHT are based on the spanning-tree algorithm. The algorithms construct a 
spanning tree rooted at each node. For LET, each branch of the tree is allocated a 
weight corresponding to the energy needed to deliver a bundle. On the other hand, 
all branches are set to one for MHT. Note that, the spanning tree needs to be re- 
computed if there are topological changes. Their results show that LET consume 
slightly less energy than MHT. In addition, increasing the number of nodes also 
has an impact on the energy consumed by mobile nodes. 

The authors of R51l consider conserving the energy of DFs. They propose 
ferry replacement protocols where different nodes take turns to be DFs. The first 
protocol requires the current ferry to designate a successor that takes over its func- 
tionality upon failure. Specifically, it appoints the first node that it meets that has 
a higher capability than a given threshold. In the second protocol, nodes conduct 
an election. Each node computes a backoff delay according to their capability. 
Nodes with a shorter backoff delay will become the next DF. Their results show 
15% reduction in overheads when data ferries have a nominated successor. How- 
ever, in the second protocol, nodes spend more than 1000 seconds electing the 
next ferry, which delays bundle delivery by at least 2000 seconds. However, the 
protocol can fail to designate or elect a successor in two scenarios: (1) when all 
DF candidates have the same backoff delay, and (2) the capability of the current 
DF falling below a given threshold before it is able to appoint a new DF. 

In ll46l . DFs, also known as MULEs, are used to provide cost-effective con- 
nectivity in sparse sensor networks and to reduce the power requirement of sen- 
sors. The authors propose a three tier architecture comprising of access points 



18 



(APs), MULEs and sensor nodes. APs are connected to the Internet, and are used 
for storing and analyzing data from sensor nodes. MULEs have large storage 
capacities, renewable power and connect asynchronously to the APs and sensor 
nodes. In addition, they can communicate with each other. The authors proved 
that i) a high density of MULEs improves system robustness, and ii) sensor buffer 
requirements are inversely proportional to the number of MULE nodes. Specif- 
ically, when the authors increase the percentage of MULEs from 0.1% to 10%, 
each with an infinite-buffer, they show a 99.5% reduction in buffer consumption. 
Nevertheless, to guarantee 100% delivery, their scheme requires duplicated bun- 
dles. 

2.3.4 Space Communications 

The advantages of using DFs in deep space networks are as follows H71l . Firstly, 
DFs cut down the long distance between two planets into relatively shorter ones, 
which allows each segment of the route to have a high data rate. Secondly, all 
relays share the energy expenditure of transmitting data on a given route. Lastly, 
DTNs provide better connectivity as other planetary objects can be used as re- 
lays. For example, one can set the moon to be a DF which helps relay messages 
when Mars is not visible from Earth. In particular, these messages can contain 
commands that control a spacecraft. 

Henceforth, researchers have devised numerous routing protocols for Inter- 
Planetary Network (IPN). The Interrogation-Based Relay Routing (IBRR) [|48l 
protocol is designed for free space data transmission between Low Earth Orbit 
(LEO) satellites. When two satellites are in the vicinity of each other, they engage 
in an interrogation process to exchange orbital information and routing tables. 
Each satellite will then decide the next best hop according to the following met- 
rics: 

• Spatial location and orbital information 

• Link bandwidth 

• Relative velocity /mobility 

• Vicinity of a satellite to other satellites and ground stations 

• Memory capacity 

• Data rate 

IBRR also provides multipath routing. If there are more than one disjoint 
route to the destination, the source stores all these routes in its cache and selects 
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the best available route. Upon a path failure, it selects an alternate path and routing 
resumes. However, there are several problems: (a) the authors have not compared 
the impact of the aforementioned metrics. They assume that if one node has a 
lower bandwidth but with a higher mobility, nodes will not be able to decide which 
metric is best, (b) the protocol does not provide a route when a destination is more 
than one hop away, and (c) each satellite does not consider whether a neighbor 
has any success in delivering bundles to a given destination 

2.3.5 Inter-Cluster Communications 

To date, there are several cluster-based routing protocols. In [49], nodes in a 
cluster employ the Destination Sequenced Distance Vector (DSDV) If50l routing 
protocol, whereas a single DF is used to provide a communication channel be- 
tween clusters. Each cluster has multiple gateway nodes that transmit and receive 
bundles to/from a DF. Nodes in a cluster send their bundles to gateway nodes us- 
ing one of the following transmission policies: (a) random - nodes uniformly pick 
and transmit their bundles to a gateway, (b) proportional - nodes send bundles to 
the gateway with the highest storage capacity, and (c) nearest - nodes choose the 
gateway with the shortest hop count. Upon meeting a gateway, a DF first transmits 
any outgoing traffic before accepting incoming traffic or vice- versa. Alternatively, 
it could transmit bundles in a round-robin manner. 

Mobile Relay Protocol (MRP) lf5TTl combines the advantages of traditional 
and DTN routing protocols. MRP assumes that every node is mobile, and nodes 
are connected by a traditional routing protocol, i.e., DSDV [|50l . and also form a 
virtual cluster. Relay nodes store bundles until they can be transmitted; i.e., they 
meet a node with a valid route to the destination. If traditional routing fails to 
find a route, bundles are handed over to the DTN layer. Once relay nodes find a 
valid route, it reverts back to conventional IP routing. The authors evaluated the 
delivery rate and latency of MRP using three mobility models: (i) random - each 
node selects its destination randomly within an allowed area, (ii) soccer player - 
a node has a higher probability of picking a nearby destination, and (iii) homing 
pigeon - each node has a "home" location, and its speed is distributed uniformly 
between and 1 m/s, and nodes choose a random point as the destination with a 
probability of \, where r is the distance from the "home" to a given destination 
point, before moving back to its "home" base. In each mobility model, MRP is 
able to deliver over 95% of the bundles. Nodes experience the best latency when 
they move according to the homing pigeon model. In addition, they show that 
MRP is unaffected by node density. There are two problems with MRP. Firstly, 
transmission between clusters is inefficient because bundles are only handed to 
their respective destination when a DF arrives at the corresponding cluster. Hence, 
bundles experience variable delays. Secondly, as MRP relies on DSDV, it is not 
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suitable for sparse DTNs that have no contemporaneous paths between nodes. 

In Il52l . the authors propose a protocol that consists of the following messen- 
gers/DFs: regional and independent. A regional messenger only carries bundles 
for the region it belongs to, while the latter one is used for delivering bundles to 
all regions. The authors also propose the following strategies: (i) periodic - DFs 
depart regions according to pre-determined schedules, (ii) on-demand - DFs will 
leave a region as soon as any nodes require transmission, and (iii) storage-based 
- DFs leave its region once its buffer is full. Their results show that in terms of 
average delay, on-demand outperforms storage-based by 50%. On the other hand, 
the storage-based scheme performs best in terms of transmission cost. That is, in 
the storage-based scheme, DFs travel j^ th of the trips required by the on-demand 
scheme. However, the problem is that the delivery strategies of DFs are fixed and 
are not adaptive to changing network parameters. We postulate that combining 
all three strategies to be a worthwhile approach. That is, a DF can modify its 
movement based on memory utilization, transmission requirements from nodes, 
and allocated paths. 

The Context-Aware Routing (CAR) [53:] protocol elects nodes as DFs if they 
have a high number of connections and energy within a cloud or a given geograph- 
ical area. These DFs are then responsible for ferrying messages between clouds. 
Their experiments compare CAR's performance with pure flooding and epidemic 
ll2TTl . Their results show that CAR achieves a 70% delivery rate - assuming nodes 
move according to the random way -point model. This is 10% higher than pure 
flooding, but 10% lower than epidemic. However, their experiments only involve 
one cloud. Therefore, the performance of CAR in multi-clouds scenarios is un- 
clear. Moreover, the authors did not specify any protocols for communications 
between DFs. 

2.3.6 Discussion 

Table ID summarizes the key features of the aforementioned DF-based routing pro- 
tocols. These protocols can be characterized by (i) the type of nodes that are des- 
ignated as DFs, (ii) routes taken by DFs, and (iii) the number of DFs. In particular, 
we can see various DF types; e.g., buses and satellites. Apart from that, DFs have 
a variety of mobility patterns, and have ample resources. Moreover, depending on 
the application, multiple DFs can be deployed to promote better energy efficiency, 
delivery rate, and reliability. The fundamental assumption of these protocols is 
that there exists one or more nodes/DFs with well known properties. However, 
this assumption is not valid in DTNs where all nodes have pseudo-random move- 
ments; e.g., humans. In the next section, we present protocols that dynamically 
learn the invariant properties of a given network in order to route bundles. 
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Table 4: A summary of data ferrying routing protocols. 



Protocol 


DF Type 


Nodes Movement 


Ferry type (NI/FI) 


Number of DFs 


Target Applications 


MF [41] 




Random waypoint 








MF with single 








1 




ferrying node |39l 




Stationary nodes 








MF with multiple 




with deterministic 


FI 






ferrying nodes BUI 


Designated nodes 


DFs movement 






Generic 






Nodes periodically 








MV 1431 




visit known 
locations 


FI 


> 1 


Exchange of personal data 




Nodes with 


On demand 










large storage 


movements between 


FI 






MULE EH 


capacities and 
renewable energy 


sensors and APs 
between sensors 






Tracking, and 


DF and body 










monitoring 


sensor IPT51 


Sensor 


As per human 
movement 


FI 






Energy-efficient 




As dictated by a 




> 1 




MFIB41 




minimum 
spanning tree 








MDDV (421 


Vehicles 


According to 
vehicle routes 


Both 




V2V 


Ferry replacement 


Wireless 


Nodes move 


NI 


Id) 


WSN 


protocol |021 


nodes 


along routes 








IBRR [48] 


Satellites 


Geosynchronous 




> 1 




DSDV [49| 






FI 


1 


Inter-planetary 


MRP eh 










Internet 


Combined DF 


Wireless 


Move among 




> 1 




scheme ll52l 


nodes 


groups or clusters 








CAR [El 






Both 


1(1) 


General 

communications 






Between stations 






Data backup and 






according to track 






connectivity to 


TrainNet HSl 


Train 


layout and 
timetable 


FI 


1 


rural areas 


(1) with load balancing, multiple nodes g 


et to become a DF 









2.4 Statistical 



Proactive routing protocols rely on past or historical information such as tempo- 
ral, spatial and nodes performance. We will first outline works, see Section 12.4. 11 
that use temporal information before discussing those that exploit the spatial loca- 
tion and trajectory of nodes in Section 12.4.21 Lastly, in Section [2.4.3L we present 
routing protocols that seek out nodes with good delivery ratios. 

2.4.1 Temporal 

In this category, a time-related function is used to aid forwarding decision. For 
example, the authors of ll54ll use the encounter times of nodes. This information 
is then used by nodes when forwarding bundles, where they preferentially select 
nodes with the smallest interval between rendezvous periods. In IfTTTl . Jain et al. 
assume the availability of future contact periods. They designed a metric called 
Minimum Expected Delay (MED) and modified Dijkstra algorithm to compute 
the path with minimal end-to-end delay. As pointed out by Jones et al. 11551 . MED 
is only suitable for certain types of DTNs where future contact times are available, 
e.g., satellites. To address this limitation, they proposed a new metric called Min- 
imum Estimated Expected Delay (MEED), which is calculated using past contact 
history. Each node then floods this metric throughout a DTN using a link-state 
routing protocol, which unfortunately, incurs a significant amount of overheads. 
Moreover, it is unclear whether such an approach works in sparse DTNs. An- 
other example is Probabilistic Routing Protocol using History of Encounters and 
Transitivity (PROPHET) Il56ll . which uses a delivery predictability metric P. The 
metric has three features. Firstly, P is computed iteratively using prior values. 
Specifically, the P value of node A to B is defined as, 

P ( a, b) = P {a ,b) old + (1 - P(a,b) old ) X P mit (3) 

where P ini t is the initial value, and P( a ,b)oid is the old value. Secondly, the value 
of P degrades if nodes do not encounter each other within a given time interval. 
In other words, 

P(a,b) = P(a,b) old X J K (4) 

Here, 7 G [0, 1] is a fixed constant, and K is the time units that have elapsed since 
the last update. Lastly, the metric P can be computed using those from other 
nodes. That is, if node A and B first meet each other, followed by node B meeting 
C, then 

P(a,c) = P(a,c) M + (1 - P(a,c) old ) X P (0j6) X P (6jC) X (5 (5) 

where < /3 < 1 is set manually to control the impact of transitivity. 
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In (571, the authors propose PROPHET+, where a "deliverability" value, which 
is used to determine an appropriate routing path, is based on the following time- 
varying parameters: buffer size, power, bandwidth, and popularity. Note that, 
popularity is defined as 1 — -j^-, where N t and M t are the transmission rate and 
M t node's capacity in a given time interval respectively. Their results show 
PROPHET + is able to reduce packet loss and delays because it only transmits 
bundles to nodes that have ample storage and power. However, as pointed out by 
the authors, PROPHET + does not consider frequency of encounters. 

The authors of ll58l showed via experimental studies that cumulative contact 
characteristics, e.g., probability of node contacts, do not capture transient contact 
patterns and connectivity. For example, students at a university may meet for a 
prolonged period during class, and they may have a different set of friends when 
they are at home. Any routing decision must therefore identify the correct subset 
of nodes when delivering bundles. Moreover, when students are in class, they may 
form a connected network. These students are said to have indirect contacts, and 
hence, present an ideal setting for bundle dissemination. To this end, the authors 
propose a new metric that captures both direct and transient contacts within a 
fixed time interval. From trace-based simulation studies, they showed protocols 
achieved up to 50% improvement in bundle delivery ratio. 

In 11591 . the authors propose a protocol that is analogous to heat transfer. 
Specifically, the delivery probability from nodes to sinks corresponds to the ex- 
change of heat where nodes with a higher temperature transfer bundles to those 
with a lower temperature. The temperature of sinks is a constant T, which is set 
to a much higher value than other nodes. When nodes pass by a sink, they will 
be "heated", and hence, nodes with higher temperature has a higher probability 
of receiving bundles from other nodes. The temperature of each node depends 
on their mobility and the frequency in which they visit sink nodes. When nodes 
encounter each other, the one with the higher temperature will decrease in value, 
and vice-versa. This means nodes with a lower temperature will send bundles to 
those with a higher temperature as they have recently visited a sink. The authors 
deploy five nodes that move according to the random waypoint mobility model. 
Their results show that nodes that are geographically closer to the sink have a 
higher delivery probability. However, the protocol has only been tested in a small 
network with five nodes. 

2.4.2 Spatial 

The nodes in this category record the speed, direction and mobility pattern of 
other nodes. For example, Utility-based Distributed routing algorithm with Multi- 
copies (UDM) ll60l prioritizes nodes according to the number of connections a 
node has to their home communities. Here, "home communities" is defined as lo- 
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cations that nodes passed by and stayed close to most frequently. This means these 
nodes are more likely to deliver bundles destined to nodes in a given home com- 
munity. Apart from that, UDM uses binary transmission, where nodes send half 
of their bundles to another node as long as they have more than one bundle. Their 
experiments compare UDM with the epidemic and spray-and-wait routing proto- 
col in terms of delivery rate and average delay. Their results show that UDM de- 
creases nodes' average delay by 500 seconds when transmitting 50 bundles, which 
is half that of spray-and-wait and a third of the average delivery rate achieved by 
epidemic routing protocol. It is unclear whether similar results can be obtained 
for a different mobility model. 

Both lf6TTl and Il62ll propose to exploit movement vectors. They assume that 
nodes have position awareness but do not know each other's movement pattern. 
The authors of lloTI propose the Motion Vector (MoVe) routing protocol, which 
uses node velocity and angle to calculate the shortest distance to a given destina- 
tion. Specifically, when they encounter each other, they compare their trajectory, 
and bundles are forwarded to nodes that are headed to the corresponding destina- 
tion. Their results, from experiments comprising of 70 nodes with fixed source 
and destination nodes, show that increasing node numbers improves delivery rate 
and delay. Their experiments, however, do not consider the impact of duplicated 
bundles. 

Vector Routing (VeRo) Il62l uses the trajectory of nodes when forwarding bun- 
dles. Specifically, nodes record their position and angle changes, and preferen- 
tially exchange a bundle with a node that is moving away from it. The main lim- 
itation of VeRo is that stationary nodes tend to receive the most bundles because 
every other nodes is effectively moving away from them. In a different work, the 
authors of Similarity Degree-based Mobility Pattern Aware Routing (SD-MPAR) 
ll63Tl assume that nodes with the same mobility patterns will tend to have simi- 
lar movement angle and shorter distance between them; i.e., have stable relative 
positions. When two nodes encounter each other, they compare their similarity 
degree, which is a function related to the angle and distance between them. Nodes 
transmit bundles to those with a higher similarity degree. Their results show com- 
munication range to have an impact on delivery rate and average delay. Moreover, 
their results show nodes that have a higher similarity leads to better delivery prob- 
ability. This, however, is at the expense of additional computational and storage 
capabilities. 

The authors of ll64l exploit the mobility patterns of nodes when making rout- 
ing decisions. For example, a node will prefer to route bundles to nodes that are 
closer or headed toward the destination. They propose to identify mobility pat- 
terns according to four functions. That is, given the Cartesian coordinate of node 
% and j, (xi, t/i) and (xj,yj), they calculate their (i) Euclidean distance, (ii) Can- 
berra distance, (iii) Cosine angle separation, and (iv) matching distance, where 
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nodes are considered to have a similar distance if xj — xi < 9, yj — yi<6, where 
8 is a predefined value. Their results show that the Euclidean distance and Cosine 
angle separation function have the best performance in terms of average delay. As 
pointed out by the authors, the protocol can be improved further by incorporating 
temporal factors such as encounter duration or frequency. 

Lastly, Nelson et al. Il6*5ll propose an Encounter-Based Routing (EBR) proto- 
col. Every node maintains a metric that reflects the average number of contacts 
within a given time interval. This metric is then used to determine the number of 
bundle copies that is to be transmitted in each contact. Specifically, a node with 
a high contact value will receive more copies of a bundle because it has a better 
chance of propagating a message, which leads to a higher bundle delivery ratio. 
A key feature of the proposed routing protocol is that it bounds the maximum 
number of copies of a given bundle to L, where L is either a fixed or probabilistic 
value. This helps to keep resource consumption low, unlike works such as Max- 
Prop ll66l and RAPID ll67l that require a high resource consumption in order to 
achieve comparable delivery ratio. 

2.4.3 Stochastic 

The protocols in this category maintain a time varying network topology that is up- 
dated whenever nodes encounter each other. For example, nodes using the Short- 
est Expected Path Routing (SEPR) ll68Tl protocol maintain a stochastic model of 
the network. Each node constructs a time varying graph comprising of nodes they 
have encountered, and links that reflect the connection probability between nodes. 
This also means nodes will exchange a link probability table containing past en- 
counters whenever they meet other nodes. Applying the Dijkstra algorithm on this 
graph, each node then calculates the expected path length to a given destination. 
Their experiments demonstrate that SEPR has an improved delivery gain of 35% 
with a 50% reduction in resource consumption as compared to epidemic routing 
and its variants. The key limitation of SEPR is that it has poor scalability due to 
its reliance on the Dijkstra algorithm. Moreover, it is not suitable for DTNs with 
high node mobility. 

MaxProp Il66ll improves MV in the following manner. Firstly, MaxProp uses 
the hop count of bundles to better manage network resources. Secondly, acknowl- 
edgments are propagated through the network to erase expired bundles. Lastly, 
MaxProp deploys a mechanism to eliminate duplicated bundles. Similar to MV, 
MaxProp utilizes delivery probability during bundle transmission, in which each 
node initially defines their delivery probability as 
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where / is the initial delivery probability, \s\ is the number of nodes in the net- 
work. Upon encountering each other, nodes will update their delivery probability. 
For example, given node A and B that exist in a network with n nodes, where 
/a and fs represent their prior delivery rate, their new delivery rate f' A and f' B is 
calculated as f 1 

f*A = I'b = — 2 ( 7 ) 
E / + I a + 1 

n=l 

n-2 

where J2 f represents the sum of other nodes' delivery rate. Each node then 

n=l 

maintains a sorted list of nodes delivery rates. Bundles are then sent to nodes 
with the best delivery rate. Apart from that, MaxProp also employs the follow- 
ing prioritized bundle delivery schemes. First, all bundles destined to neighbors 
are transferred first, followed by routing information, which includes the proba- 
bility of meeting every other node. After that, nodes deliver ACKs, followed by 
bundles that have not traversed far in the network. Finally, nodes transmit the 
remaining packets and a hop-count list that reflects the list of nodes a bundle has 
already traversed. MaxProp is evaluated using a real-world testbed called UMass- 
DieselNet. The authors compared MaxProp to three routing algorithms. Namely, 
(i) random transmission, (ii) Dijkstra algorithm, and (iii) most Encountered/Drop 
Least Encountered algorithm (ME/DLE) - where bundles are transmitted to nodes 
that they encounter frequently, and are dropped if a bundle is destined for a node 
that the receiving node rarely meets. Their results show that bigger buffer size 
leads to an increased delivery rate. The key limitation of this protocol is that the 
initial delivery probability is a function of the number of nodes in the network. 
Unfortunately, without global knowledge, this probability is inaccurate. 

Context-aware Adaptive Routing in Delay Tolerant Mobile Sensor Networks 
(SCAR) [69] calculates the delivery probability of a neighboring node according 
to the number of (i) encounters it has with sink nodes, (ii) connectivity with other 
nodes, and (iii) battery capacity. In addition, SCAR is able to replicate a bundle 
to R nodes with the highest delivery probability. The authors, however, have not 
evaluated SCAR against any existing DTN routing protocols. 

Balasubramanian et al. Il67ll propose a routing protocol that considers network 
resources such as bandwidth and storage when optimizing a given route metric. 
This is especially critical when nodes have resource constraints. The said proto- 
col, called Resource Allocation Protocol for Intentional DTN (RAPID), considers 
the utility of replicating a bundle at each rendezvous. Here, the utility of a bun- 
dle models the benefits of replicating a bundle whilst taking into account resource 
constraints. They also propose a control channel that allows nodes to collect in- 
formation such as past transfers and encounters, which are then used to determine 
the utility — of a bundle i, where the numerator represents the increase in utility 
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if the bundle is replicated, and s, is the bundle size. Each node only forwards 
bundles with the highest utility amongst those in its buffer. RAPID's performance 
is sensitive to how much and frequent information is distributed over the control 
channel. Their trace-based studies show RAPID is able to deliver 88% of pack- 
ets with an average delay of 91 minutes. These results, however, are achieved 
over one set of traces fTOll . and it is unclear whether similar performance can be 
achieved over other traces and indeed theoretical models. 

2.4.4 Social Networks 

Another approach taken by researchers to improve the performance of routing pro- 
tocols is by identifying invariant properties in social networks. Briefly, informa- 
tion diffusion in social networks is a well studied problem and bears resemblance 
to data transmissions. As shown by Milgram in H106L individuals tend to be sep- 
arated by six degrees of separation. In his famous experiment, 60 letters destined 
to a stockbroker were handed to a different person that only knows the stockbro- 
ker's name. The study found that the median chain length of intermediate letter 
holders was approximately six. This experiments demonstrates the "small-world 
phenomenon", and the ability to propagate information in a seemingly random 
network. 

Henceforth, the authors of II 10711 propose a routing protocol that exploits the 
fact that nodes in the real world routinely move and stay in several well known 
places. This means nodes that visit these places have a contact probability and 
hence, will have more success delivering bundles. Another example is H108L 
where Daly et al. propose three relevant properties of information flow: central- 
ity, ties and predictors. Centrality reflects a node's degree of importance, where 
a node with good centrality has a robust relationship with other nodes in the net- 
works. In particular, centrality can be further measured by three sub-degrees: 
Freeman's degree, closeness and betweenness. Freeman's degree is the number 
of directly connected nodes. Closeness refers to the path length or hop count be- 
tween two nodes. Finally, betweenness corresponds to the number of times a node 
is used to relay bundles. The ties property evaluates the robustness of the connec- 
tion between two nodes, where nodes with the following properties are deemed to 
have a high delivery rate: frequently connect at the same location, long encounter 
duration, and are well known to other nodes. Lastly, the predictor property indi- 
cates whether two nodes are likely to be connected to each other. For example, as 
proven in H109H , based on an analysis of the evolution of scientific collaborations, 
two co-authors of previous works tend to have a high probability of working to- 
gether again in the future. In other words, previous connection is a good indicator 
of future ones. In another work H110H . Gao et al. consider user interests during 
routing. They too exploit centrality, where they seek to exploit relays with high 
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centrality in order to increase user satisfaction. In H111H . the authors consider 
nodes with high centrality as well as their contact duration with others. This is ad- 
vantageous as popular nodes may only have brief encounters with a high number 
of nodes, and thereby, have limited communication capacity. 

2.4.5 Discussion 

Table [5] summarizes the categories and metrics used by proactive routing proto- 
cols. To determine the next hop used for delivering bundles, protocols collect 
information such as time-varying metrics or geographical information. Note that 
the latter information is easily obtained when nodes are equipped with a GPS unit. 

Works that exploit properties of social networks are summarized in Table [6j 
We can see all routing schemes utilize the notion of groups or communities where 
nodes are classified according to their common locations or hobbies. Other than 
that, researchers also seek out popular nodes, as determined by their connectivity 
to other nodes. These nodes, therefore, serve as a "good" next-hop when forward- 
ing bundles. 



Table 5: A summary of proactive routing protocols 



Protocol 


Category 


Metric 


TIR [54] 


Temporal 


Encounter times 


PROPHET (56| 
PROPHET" 1 " [57] 
HEAT (53 


Time-varying delivery probability 


UDM [601 


Spatial 


Location visit frequency 


EBR[65] 


Encounter frequency 


MoVe ED 
VeRo 11621 


Movement vector 


SD-MPAR (631 


Similarity of mobility pattern 


MobySpace [64] 


Shortest distance to destination 


SEPRflU 


Stochastic 


Shortest path calculated by delivery 
probability 


SCARED 


Connectivity change with other nodes 
and remaining battery capacity 


RAPID [72] 


Network bandwidth and node storage 


MaxProp [66 1 


Delivery probability 
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Table 6: A summary of social networks research 



Studies 


Definition of 
a group 


Properties/Metrics 


Example 


11241 


Common 
location 


Connections with 
other nodes 


People that work in the same office 
or pass each other frequently 


|125l 


11261 


Selfishness and 
altruism 


H27fl 


Same hobbies 
and locations 


Social profile of 
encountered nodes 


Club members 


11281(951 


Social 
relationship 


Delivery probability 
and frequency to a 
community 


Family members or classmates or 
popular individuals/communities 


1129fl 


Locations 


Visit frequency 


People visiting the same shopping 
mall 


11101 


Mobile users 


User satisfcation 


Individuals with high centrality; 
e.g., postmen 



2.5 Coding 

A standard approach taken by protocols we have reviewed thus far to improve 
bundle delivery ratio or their robustness is to increase the number of redundant 
transmissions. The downside, however, is higher buffer occupancy level, which 
leads to packet loss. Moreover, nodes may not be able to transmit all buffered 
bundles in each encounter. To this end, a number of researchers have proposed 
to incorporate erasure coding, e.g., f73l 11741 . Briefly, given a bundle of size M 
and a replication factor r, an erasure coding algorithm produces M*r/b blocks 
of size b. The key feature is that any (1 + e) x ^ erasure coded blocks can 
be used to reconstruct the original bundle. In other words, a receiver is able to 
recover a bundle after receiving a fraction of M*r/b coded bundles. As a result, 
the dynamics of a DTN, such as uncertain contact periods and limited resources 
at nodes, have a lesser impact on bundle delivery. As we showed later, this key 
characteristic increases the probability of bundle delivery whilst consuming less 
resources as compared to protocols that simply increase the number of duplicated 
bundles. 

Wang et al. 117511 proposed a novel forwarding method where coded-blocks are 
distributed widely to a number of nodes. Advantageously, it limits each node to 
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only a fraction of the coded blocks, which has the effect of constraining routing 
overheads. The method works as follows. Given a replication factor r, a source 
first divides a message into k blocks, which are then coded into k x r coded blocks. 
It then divides these blocks equally amongst kx r relays, where A; is a constant. A 
receiver recovers a bundle completely after receiving - of the coded blocks. That 
is, a receiver re-constructs a message after meeting k relays. The authors showed 
via simulation and analytical studies that the use of erasure codes improves the 
worst case delay with a constant overhead. In particular, the distribution of coded 
messages to a wide variety of relays help mask problems with a subset of these 
relays. Moreover, unlike conventional routing protocols, the proposed method has 
constant overhead as it does not need to increase bundle redundancies to achieve 
comparable delivery ratios. 

A fundamental problem in using erasure codes is determining the fraction of 
coded messages that should be transmitted amongst N paths such that the delivery 
probability is maximized. This problem is made more difficult by the fact that 
each path has a different failure probability and throughput that are dependent on 
node capability and mobility. Jain et al. If76l studied this problem over two cases. 
The first is where the N paths are governed by two models: Bernoulli and Gaus- 
sian. The former model represents the scenario where a message is either fully lost 
or delivered, whereas in the latter, only a fraction of the messages are delivered. 
To cope with the dynamic nature of DTN paths, the authors propose an optimiza- 
tion framework, and showed the Bernoulli case to be NP-hard. Specifically, they 
propose a Mixed Integer Program (MIP) for the Bernoulli case, and adopt tech- 
niques from modern portfolio theory 117711 to attack the Gaussian case. A limitation 
of this work is the use of a fixed-rate erasure code. In this respect, the authors have 
not provided any methods to determine the optimal rate. A key assumption in this 
work is that the availability of N paths that meet a given delay constraint, each of 
which has a well-known and constant probability over time. In a different work, 
Tang et al. lT78l addressed this limitation by considering a set of paths, each with 
varying stability. They proposed to interleave coded blocks before transmitting 
them on available paths. Their experimental results show sources using interleav- 
ing obtaining up to 30% higher packet delivery ratio. An interesting result is that 
delivery performance does not improve with increasing number of paths as coded 
blocks are more likely to be forwarded onto poor paths. To address this limitation, 
the authors propose to send a different number of coded blocks on each available 
path as a future work. 

In 11791 , Liao et al. used the contact frequency between a relay and destina- 
tion node in order to decide the proportion of packets that are exchanged in each 
encounter. In a subsequent work 11801 . they extended their protocol to consider 
available resources at a given relay. This is an important consideration as a relay 
with a high probability of meeting a destination may be low in power or buffer 
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space. The authors showed via simulation that the delivery ratio of the proposed 
protocols perform better than Spray-and-Wait lf8~TI and Wang et al. B751's method. 
A caveat of these protocols, however, is that nodes need to learn the contact fre- 
quency of nodes. Similarly, the authors of [82J employ erasure coding to help 
improve delivery with minimum overhead in a delay or fault tolerant mobile sen- 
sor network. Specifically, their protocol determines the optimal erasure coding 
parameters such as the number of data blocks and redundancy given the current 
delivery probability. 

Wang et al. [76] and Jain et al. Il76ll have spurred a number of studies on us- 
ing coding in DTNs. In [83], Altman et al. provided a closed-form solution that 
allows one to compute the delay and energy consumption of a DTN as a func- 
tion of the coding used. In this case, the authors considered erasure codes such 
as Reed-Solomon, and rate-less fountain codes. The main limitation, however, 
is that they only consider two-hops routing schemes, where a source transmits a 
bundle to all encountered nodes, and relays only forward to destination nodes. In 
(841, the authors argue that coding based schemes are in-efficient in DTNs with 
good connectivity because of the additional information carried in coded blocks. 
Moreover, as each relay only receives a limited number of coded blocks, exist- 
ing approaches waste the remaining time of a given contact. To this end, they 
propose two protocols: (i) Aggressive Erasure Code (A-EC), where a node trans- 
mits as many coded blocks as possible in every contact duration, and (ii) Hybrid 
Erasure Code (H-EC), where a node transmits two copies of a coded block. In 
other words, the first copy is sent as per 117511 . and the second copy is transmit- 
ted using A-EC. A disadvantage of both protocols is their higher traffic overhead. 
In a different work, Vellambi et al. 11851 use rateless codes to improve delivery 
reliability. In particular, they aim to overcome the negative impact of discarding 
expired bundles, a standard approach used to reduce buffer occupancy level. Un- 
like [17611 . which used a fixed-rate erasure code, the authors consider rate-less or 
fountain codes; see ll86l . Their simulation studies over Random Waypoint Model 
and UMassDieselNet dataset [ref] show better performance over replication based 
protocols, and the method proposed by Wang et al. Il75l . An interesting finding is 
that reliability and latency degrade gracefully with decreasing expiry time. 

Bulut et al. [[871 study the cost of erasure coding, where cost is defined as the 
total bytes transmitted between a source and destination pair. In particular, they 
study the cost incurred by the following spraying algorithms: binary and source 
lISTI . They found messages are distributed more slowly when nodes use source 
as compared to binary spraying. However, binary spraying incurs high overheads. 
To this end, the authors designed a protocol that adjusts the replication factor r 
and number of blocks k, see [17511 . dynamically based on two factors: (i) a node's 
probability of meeting a destination node within a given deadline, and (ii) the 
required delivery ratio. In addition, they consider spraying messages at different 
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periods, each with its own replication factor that is a function of the number of 
messages that has arrived at the destination in the last period. Their method, 
however, requires nodes' contact history and relies on acknowledgments from a 
destination node, which may not be forthcoming. 

In [|75l . Widmer et al. studied DTNs where no oracles are available to pro- 
vide topological information. In this type of DTNs, conventional approaches tend 
to employ probabilistic routing protocols; see Section IZ4l The authors improve 
upon these protocols using network coding [88]. Consider the following simple 
example comprising of nodes placed in a linear topology: A-B-C. Assume node A 
transmits packet a to C, and conversely, node C sends packet c to node A. When 
node B gets both packets, it broadcasts packet b, where b = a (B c. Node A can 
then retrieve packet c as follows: c = b © a. A similar operation is carried out 
by node C to retrieve packet a. It is worthwhile pointing out that unlike conven- 
tional mobile ad-hoc networks, e.g., [[89], the probability of having multiple nodes 
within the transmission range of a node is exceedingly small in DTNs. Widmer 
et al. adopt linear network coding Il90ll , where a node sends a linear combination 
Y,i giXi, where Xi is the packet of interest. Here Xi and ^ are interpreted over 
a finite field. A nice property of linear network coding is that n packets can be 
decoded if a node receives m combinations, and m has rank n. Effectively, these 
combinations can be viewed as independent equations which form a system of 
linear equations that can then be inverted to reveal the n packets. To this end, the 
authors designed a scheme that ensures a sufficient number of combinations are 
propagated by each node such that packets can be decoded with a high probability 
whilst keeping memory requirement at a minimum. More specifically, for a given 
forwarding factor d, a node generates [d\ information vectors, and an additional 
vector is generated with probability d - \d\. From extensive simulation results, 
the authors showed that the proposed scheme performs better than probabilistic 
algorithms in terms of packet delivery in extreme conditions. Moreover, they ob- 
served that the proposed algorithm benefited from node mobility. A key future 
work is extending their scheme to consider the case where non sink nodes are not 
required to decode data. 

In 11911 , the authors study the impact of Random Linear Coding (RLC) on the 
delivery delay of epidemic routing protocols subjected to bandwidth and buffer 
constraints. At each encounter, nodes exchange their encoding matrix to deter- 
mine whether any combinations will lead to an improvement in rank. If so, these 
combinations and random coefficients are then exchanged. Moreover, an anti- 
packet is generated once a node decodes all packets. The authors studied two 
cases: (i) a single block of packets destined to one or more receivers, and (ii) mul- 
tiple coded blocks. They showed that RLC improves the delivery and delay per- 
formance of epidemic routing protocols in both bandwidth and buffer constrained 
cases - a similar conclusion was also put forth by Lin et al. Il9"2ll . In lIMTl . the 
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authors showed via an analytical framework that network coding yields a signif- 
icant advantage in terms of reducing delay. In addition, they proposed a priority 
coding protocol that classifies K packets into different priority levels. Each level 
is then coded and transmitted. Once an ACK is received by the source, it codes 
and transmits packets in the next priority level. 

Ahmed et al. [|93l exploit the power law behavior observed in DTNs compris- 
ing of humans carrying personal communication devices (941 . Another example 
is vehicle based DTNs [66]. This means a small number of nodes have a high 
degree of connectivity. In other words, they make good relays or hubs as all other 
nodes are reachable via them over a small number of hops. To this end, Ahmed 
et al. designed a protocol whereby nodes forward bundles to hubs, which then 
construct a linear combination of these messages to reveal an encoded message. 
This is carried out to take into account the bounded bandwidth of each hub. Hubs 
exchange coded messages that increase their matrix rank whenever they meet. 
The resulting protocol is shown to have 20% higher delivery ratio as compared 
to Epidemic EH, Spray-and-Wait ED. RLC (9TJ, and BubbleRap (951 in a DTN 
comprising of 1163 buses operating in Seattle Il96l . 

2.5.1 Discussion 

Existing works can be divided into source and network coding based approaches. 
The main problem with source based approaches is that intermediate or relay 
nodes may waste bandwidth transmitting duplicated coded packets. A key dis- 
advantage of both approaches is decoding delay, which thus far, only Lin et al. 
[|97ll have addressed. Prior works can also be distinguished by their assumptions 
on DTN dynamics. That is, whether they consider prior knowledge in terms of 
contact duration and times between nodes, and whether a source is aware of the 
delivery probability at a destination. Also, coding approaches have primarily been 
evaluated in the context of epidemic based routing protocols. Therefore, it will be 
interesting to investigate their performance in DTNs that use protocols outlined 
in Section 12.31 and 12.41 Indeed, a key future work will be to evaluate their per- 
formance in conjunction with mechanisms such as the use of TTL, EC and anti- 
packets. Lastly, apart from Il871l and (791 , little work has considered adjusting the 
parameters r and k dynamically in accordance with the dynamic nature of a DTN. 

3 Multicast 

Applications may need to deliver bundles to a group of users. For example, the 
dissemination of software patches and targeted advertisements. Supporting mul- 
ticast in DTNs is non trivial as there are frequent link partitions and nodes experi- 
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encing unpredictable transmission delays. This means it is unlikely that nodes or 
subscribers will receive bundles at the same time, nor for source(s) to receive all 
acknowledgments. In addition, group membership changes, varying node speeds 
and density further add to the complexity of designing multicast protocols that 
run well in DTNs ll98l . Apart from that, multicast poses a fundamental problem 
to statistical protocols, see Section [2~4l as existing methods select relays based on 
their delivery probability to one destination only. Multicast protocols for DTNs 
can be classified into two categories: (i) unicast-dependent, and (ii) statistical. 

3.1 Unicast Dependent 

Zhao et al. Il99l define three kinds of multicast receivers. The first is called Tem- 
poral Membership (TM), where nodes that are connected in a given time period 
are viewed as members belonging to the same group. The second is called Tem- 
poral Delivery Model (TD), which combines TM with a delivery threshold for 
group members. This means, within a given time period, nodes are considered 
group members if they have connectivity to each other and can transmit bundles 
to their destination within a delay threshold. The last one, called Current-Member 
Delivery Model (CMD), considers the stability of group members - i.e., nodes 
that remain static for the duration of a multicast session. Zhao et al. also proposed 
four fundamental multicast routing algorithms: 

1. Unicast-Based Routing (UBR) - where unicast is used to emulate multicast 
whereby a source node sends bundles via unicast to each group member. 

2. Static Tree-Based Routing (STBR) - a spanning tree is constructed by a 
source node, and the tree remains static for the duration of the multicast 
session. This means if the path connecting node A and B breaks, these 
nodes will not be able to communicate. 

3. Dynamic Tree-Based Routing (DTBR) - where each node and bundle are 
allocated a group ID, and bundles are only received by group members with 
the same group ID. DTBR chooses the shortest routing path to each group 
members using the Dijkstra algorithm. 

4. Group-Based Routing (GBR) - where nodes with contemporaneous paths 
are designated as a group, and a source multicast bundles to each group 
using STBR. Within a group, flooding is used to deliver bundles. 

All four categories of multicast routing protocols have several shortcomings. 
UBR requires ferries to meet every subscriber. This means UBR does not ade- 
quately utilize the mobility of all non-ferry nodes to help improve the delivery of 
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bundles. Both STBR and GBR have poor flexibility. In STBR, all routing paths 
are static, and if the path connecting node A and B breaks, both nodes will not be 
able to communicate. In GBR, bundles are delivered to a group member, which 
then floods the bundles to other group members. However, GBR still uses STBR 
to deliver bundles. As DTBR uses Dijkstra's algorithm, it requires a DTN to have 
contemporaneous paths between nodes. 

The authors of H100H investigate a variant of DTBR called On-demand Situation- 
aware Multicast (OS -Multicast). One significant difference is that nodes build a 
dynamic multicast tree according to their history of encounters. That is, nodes 
maintain encounter records and duration with other nodes, and use this informa- 
tion to calculate the shortest path to a node. The authors evaluate the performance 
of OS-Multicast, UBR and STBR in terms of message delivery rate, delivery effi- 
ciency and average delay. Their results show that OS-Multicast has better delivery 
rate and less delay. Moreover, OS-Multicast has a higher reliability as compared 
to DTBR as there are more duplicated bundles in the network. 

The authors of Context Aware Multicast Routing (CAMR) II 1 1 H propose a 
multicast routing scheme that is based on DTBR but with the addition of a route 
recovery mechanism. Nodes multicast bundles through a recently computed short- 
est path - as per the Dijkstra algorithm. Apart from that, nodes broadcast neighbor 
discovery messages and if the number of neighbors fall below a threshold, nodes 
set the "sparsely connected" flag and increase their transmission power to recruit 
more neighbors. In terms of delivery rate, CAMR delivers 1.3% more bundles as 
compared to DTBR. In addition, the authors reported a 23% decrease in average 
delay. However, the limitation of CAMR is that the threshold value is fixed and 
not adaptable to changes in node movements. 

A critical problem for all the aforementioned multicast schemes is the reliance 
on a spanning tree. However, in a network where topologies change frequently, 
nodes experience long disconnection times, and may have no acknowledgments 
from other nodes. Hence, generating and maintaining a spanning tree in such 
a network becomes impossible. In epidemic routing protocol and its variants, 
bundles transmissions are based only on node encounters, and does not rely on 
a spanning tree. Moreover, bundles are propagated at a higher rate, and hence 
nodes experience a better delivery ratio as all nodes participate in the forwarding 
process. This is in contrast to data-ferries based multicast routing schemes, where 
bundles are delivered by designated nodes. 

3.2 Statistical 

In H102H . Abdulla et al. argue that the aforementioned approaches are not readily 
applicable in DTNs due to the lack of knowledge regarding node connectivity 
and mobility. In other words, it is difficult to form a multicast tree that adapts to 
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the vagaries of DTNs. To this end, they propose Controlled Epidemic Routing 
for Multicast (CERM). Source nodes multicast bundles to all encountered nodes 
until they are received by all subscribers. The key problem is duplicated bundles. 
In this respect, the authors propose two ways to eliminate these bundles. First, 
they propose the use of synchronization servers, which keep track of TTL value 
of bundles. Once a bundle expires, these servers flood control messages to all 
nodes to purge the bundle. The second proposal involves embedding a TTL value 
in each bundle. Their results show that delivery rate is proportional to multicast 
group size and simulation time. The group size, however, has no impact on the 
average delay. This work, however, assumes all nodes have synchronized clocks. 
In addition, it is unclear how TTL can be adapted to the varying dynamics of a 
DTN. 

The fundamental problem addressed by Gao et al. H103B is to select the min- 
imum number of relays that ensures a given bundle delivery ratio to multicast 
receivers. Interestingly, this is effectively a variant of the well known knapsack 
problem, where each item k or relay has a weight it; fe . To calculate w^, the au- 
thors employ two key social characteristics, centrality and communities. That is, 
popular relays or those that are part of the same community as multicast sub- 
scribers have a higher weight. Using trace-based simulation, the proposed pro- 
tocols, Single-Data Multicast (SDM) and Multiple-Data Multicast (MDM), are 
shown to have 20% and 50% higher bundle delivery ratio respectively than pure 
epidemic ETTl and PROPHET Il56ll . and requires fewer relay nodes. 

In the DTN Pub/Sub Protocol (DPSP) H104H . there are two types of node: pub- 
lisher and subscriber. Nodes designated as publisher store and classify bundles, 
and flood them to subscribers. Each node maintains a subscription list to record 
the multicast group they belong to, and floods this list to all other nodes. If nodes 
receive a bundle destined to a group they have not subscribed to, they will store 
and forward the bundle to nodes that are subscribed to the corresponding group. 
Their results show that DPSP has a better delivery rate but worse delay than epi- 
demic routing. The limitation of DPSP is that nodes use flooding to announce 
their subscription. Hence, it is not scalable with increasing node numbers. 

In order to support multicast, the authors of H105II propose to use either epi- 
demic routing protocol or DFs depending on group numbers. Source nodes first 
send bundles to be multicast to DFs, which then decide, according to group size, 
whether to deliver these bundles one by one or to use the epidemic routing pro- 
tocol. In terms of delivery rate and delay, their experiments show the proposed 
protocol reaches a delivery rate of 75%. However, they also showed that the aver- 
age delay decreases when the group size increases. 
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3.3 Discussion 



Table U\ summarizes the mechanisms and metrics used by each of the aforemen- 
tioned multicast routing protocols. Additionally, we indicate their delivery rate 
and delay. Thus far, these protocols assume the ability to build a spanning tree. 
Hence, there are only applicable to DTNs with semi- static nodes. Moreover, little 
to no work has investigated the advantages and disadvantages of using epidemic 
and DFs routing protocols. We elaborate on these observations further in Section 



4 Future Research Areas 

In this section, we outline efforts that seek to adapt results from social networks 
and epidemiology in the hope of increasing the delivery ratio and minimize delay 
of bundles in DTNs that use humans as a method of transport. Then, in Section 
14.21 we briefly discuss the lack of uniform research methodologies that allow re- 
searchers to compare all routing protocols objectively. After that in Section 14.31 
we discuss potential research directions of multicast protocols. 

4.1 Epidemiology 

The field of epidemiology studies the spread of disease in scale-free or social- 
contact networks. In this respect, a promising research direction is to exploit 
similar concepts and models. As Table [8] shows, DTNs and epidemiology share 
key concepts that we can exploit to create data distribution protocols. By min- 
ing social networks for contact patterns, and learning their dynamic properties, 
we can formulate design guidelines that would lead to practical and efficient data 
dissemination protocols. Watts and Strogatz HI 121 studied the identification of 
individuals with the highest "centrality" - a measure of contacts, the proportion 
of time an individual lies on the shortest path between other individuals, and the 
average steps between an individual and all other individuals. This means nodes 
can monitor the centrality of encounter objects in the hope of identifying those 
that maximize delivery ratio as well as minimize delay. Another interesting work 
which researchers can exploit is a study by Brockmann et al.'s H113H . which uses a 
Web game, called Where's George? (http://www.wheresgeorge.com/), to track the 
geographical location of dollar bills, and from that derives statistical laws of hu- 
man travel. Here, "dollar bills" are analogous to nodes carrying messages. Hence, 
by adapting the models in H113II . researchers will be able to analyze the following 
properties: the frequency and duration of contact periods between nodes, packet 
delivery ratio, and message delivery delays. Another potentially relevant work is 
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Table 7: A summary of multicast routing protocols 



Protocols 


Multicast Mechanism 


Metrics 


Limitations 


Group 
Definition 


UBR (99) 


Calculates a separate routing path 


As per the cost used 


Consume significant resources 






from a source to each group members 


by unicast protocol 


if multicast group is large 


Nodes which 


riTrnri I Jilt | 

STBR [99 1 


Constructs a static spanning tree 


Bandwidth 


Static spanning tree is not 


require 






or transmission rate 


suited for dynamic DTNs 


the same bundles 


DTBR (SO 


Uses Dijkstra algorithm 


Transmission cost 

between nodes 
in the same group 


Bundles cannot be 
transmitted between groups 


Same group ID 


GBRII991 


Uses STBR between groups and 


Transmission cost 


Uses flooding within group, 






flooding within groups 


between groups 


which consumes 
significant amount of resources 




OS-Multicast 


Uses DTBR with historical 


As per node's 


High signaling overheads 


Nodes which 


||100| 


encounters 


destination list 


associated with 
multicast tree maintenance 


require the 
same bundles 


CERM|102| 


Employs epidemic routing, and is able 
to eliminate duplicated bundles 


Expiration time 


Difficult to synchronize 
all nodes m DTNs 
due to frequent disconnections 




DPSP ||104| 


Relies on flooding and group 


Subscription list 


Uses flooding to announce 


Nodes with the same 




member filtering 




nodes' subscription 


subscription that describes 
required bundles' categories 


CAMR [101 1 


Uses DTBR with a recovery 


Neighbor counts 


Does not work well in sparse 


Nodes in 




mechanism 




networks due to the cost of 
neighbor search 


the same location 


MEPDF B105I 


Combines epidemic routing and DFs, 


Group size 


Static threshold value 


Nodes that require 




and adaptively chooses suitable 




for group size 


the same bundles 




protocols according to group size 








SDM, MDM D1031 


Forward bundle to relays with 


Cummulative 


Relays must be in contact 


Nodes that require 




highest centrality and part of 


probability to 


with source by the given 


the same bundle 




a community with multicast 


multicast subscribers 


deadline 






subscribers. 









that of Eubank et al. HI 141 . where they model contact networks using bipartite 
graphs. Their model describes people's mobility and locations, whereby two per- 
sons in the same location are likely to be infected. Eubank et al. then use the 
resulting model to analyze the spread of diseases and also to develop methods for 
surveillance and vaccination. This work shows that location can be used to cluster 
nodes that are likely to pass a given geographical area. 

4.2 Theoretical and Trace Studies 

An important observation is that the works we have reviewed thus far, most if 
not all, use the random way point mobility model. Table [9] lists the experiment 
parameters of all protocols. There is thus a need to experiment with more realistic 
models. This issue is particular critical given that DTNs are usually based on 
the movements of both inanimate and animate objects; e.g., vehicles and animals. 
Apart from that, we can see all experiments contain a small number of nodes with 
limited data rate in large areas for simulating sparse nodes density. Additionally, 
nodes are also allocated limited transmission range and buffer size. Given that all 
existing works use varying simulation methodologies, it is therefore very difficult 
to determine the "best" performing routing protocol for a given DTN. Therefore, 
there is a critical need for a unified research methodology that compares all routing 
protocols comprehensively. 

To date, only a handful of works have proposed alternative mobility models. 
Bai et al. H1151 introduce several mobility models; namely, group, freeway and 
Manhattan model. The group model is also called the Reference Point Group 
Mobility Model (RPGM) H1161 . In RPGM, nodes move together as a group and 
every group has a central node called group leader. The group leader determines 
the movement of group members. In the freeway model, nodes move according 
to predetermined routes. Lastly, the Manhattan model simulates nodes movement 
in a metropolitan scenario. Other than that, Leung et al. H1171 describe a high- 
way model where nodes enter and leave a highway through multiple entrances 
and exits. Lastly, the Homing-Pigeon-Based (HoP) model H1181 models a sce- 
nario where each community has a designated message deliverer that periodically 
carries bundles from their home community to various destinations before return- 
ing home. These works, however, have not comprehensively compared routing 
protocols designed for DTNs, and thus is an important future work. 

Recently, researchers have begun using trace based simulation studies. That 
is, instead of using theoretical models such as Random Way Point, they use traces 
of node movements at a given location; e.g., at a conference or city. It is important 
to note that trace files are specific to a given environment, and cannot be readily 
generalized to other scenarios. Moreover, they have limited number of nodes; 
i.e., they cannot be scaled readily to thousands of nodes. Their advantages, how- 
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ever, include the availability of contextual information, and group or community 
membership. Also, based on nodes' pre-recorded movements, one can easily de- 
termine nodes with high centrality, and the optimal forwarding path. Indeed, this 
is the key observation that motivated Hui et al. 11951 to develop a forwarding algo- 
rithm, called BUBBLE. They identified via trace-files analysis that people based 
DTNs are characeterizedby popular individuals or groups. In effect, individuals 
and communities have a ranking that denotes their 'popularity' in a given DTN. 
To this end, their algorithm fowards or "bubbles' bundles to increasingly popular 
nodes or communities. 

Past works usually use traces that capture a node's location, encounter times 
and durations. For instance, in H119II , researchers traced the positions of 500 taxis 
in San Francisco over 30 days using GPS and roadside servers with wireless trans- 
mitter. In a different work, the authors of HI 2011 collected the ID of Bluetooth 
devices carried by students on a university campus. By far, the following are 
the most popular trace files: (i) Dartmouth/campus H121L (ii) Haggle H122L (iii) 
MIT/reality lfT20l . (iv) National University of Singapore (NUS) lfT23l . and (v) 
UMass/diesel iPTOll . However, thus far, there has been a lack of work that com- 
pare the performance of DTN routing protocols using a variety of traces. Lastly, 
a critical issue is that no works have used a mixed of theoretical and trace based 
simulation studies to evaluate proposed protocols. 

4.3 Multicast 

To date, there are only a few multicast routing protocols. Hence, there are am- 
ple opportunities to design new ones that incorporate the various strategies used 
by epidemic, DFs and proactive based routing protocols. In particular, only a 
few works have studied the efficacy of epidemic routing protocols in delivering 
bundles to multicast group members. Thus far, in H102L the authors have only 
investigated epidemic with TTL. The authors, however, have not considered mul- 
ticast sessions with a large number of subscribers, where a small TTL value may 
lead to bundles expiring prematurely. This means epidemic with EC or epidemic 
with immunity are better suited for multicasting in DTNs. However, currently, no 
work has conducted any investigation on these two buffer policies. 

Second, no researchers have studied the effect of anti-entropy in multicast 
scenarios. In ETI . the authors pointed out that 'comparing before exchanging' 
is critical because it avoids duplicated bundles. This is important because once 
a node receives a bundle, it will never receive the bundle again even though said 
bundle has been discarded from its buffer. However, in multicast, given that there 
may be more than one subscriber, it is unclear whether anti-entropy will result 
in a lower bundle delivery ratio. Intuitively, a node that has delivered a bundle, 
and subsequently deleted the bundle before meeting another subscriber should be 
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Table 8: Analogous concepts in epidemiology and the challenged networks 



Concept 


Epidemiology 


DTNs 


Information Carrier 


Viruses 


Nodes 


Performance Metrics 


Spreading rate 


Number of bundles delivered 




Number of people infected 


Number of devices with bundles 


Application Constraints 


Virus lifecycles 


Bundle delivery deadline 


Host Constraints 


Opportunity for infection and contagiousness 


Node's transmission range and data rate. 




A person's medical profile; e.g., age, gender. 


Node's storage capacity 




An individual's susceptibility to infection 


Node already has stored bundle 


Congestion Control 


Vaccination methods 


Removing bundles from nodes 


Topology 


Population's spatial distribution 


Node's spatio-temporal patterns 



Table 9: A summary of experiment parameters 



Protocols 


Epidemic routing 


DFs 


Proactive 


Number of Nodes 


<100 


< 120 


< 240 


Mobility pattern 


Random waypoint 


Random waypoint, 


Random waypoint 






along streets 




Area 


< 50km 2 


< 100km 2 


<25km 2 


Data rate 


<10KBps 


<250KBps 


Transmission 


<300m 






Delivery rate, 






Delivery rate, 


average delay, 




Evaluation 


average delay, time 


KB/J, number of 


Delivery rate, 


metrics 


to deliver all 


times DFs move 


average delay 




bundle 


from sources to 








destination nodes 




Buffer size 


Infinite or up to 5MB 


Bundle size 


<14MB 



given another copy of said bundle to improve bundle delivery ratio. 

Third, no research has analyzed the impact of having multicast subscribers act 
as relay nodes to forward bundles. In every epidemic variant, e.g., H102H . 0411 . 
[|25l . subscribers are effectively sink nodes, and are not required to forward bun- 
dles to others. However, in multicast, it is unclear how this influences multicast 
delivery. This is an important consideration when there are more subscribers than 
non-subscribers or relay nodes. If there are only a few relay nodes, the probability 
of a subscriber meeting a relay node becomes less, which increases end-to-end 
delays. In the worst case, only the source is the relay and all other nodes are 
subscribers. This means the source will have to meet each subscriber in order to 
deliver bundles. 

Fourth, the size of a multicast group affects the delivery ratio of epidemic 
with TTL H102H . Their experiments show that when the multicast group size in- 
creases, the delivery ratio drops accordingly. In particular, when the multicast 
group size increases from two to 60, delivery ratio reduces from 95% to 50%. 
However, it is unclear whether multicast group size has any impact on delivery 
ratio if subscribers do not forward bundles, or when nodes employ EC or immu- 
nity packets to discard bundles. This is because the ratio of multicast subscribers 
to non-subscribers is a critical issue as larger multicast group sizes mean fewer 
relay nodes, and vice-versa. Moreover, in epidemic with immunity, the number 
of immunity packets corresponds to the multicast group size. Increasing the mul- 
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ticast group size translates to proportionally more immunity packets. This, how- 
ever, corresponds to faster bundle deletion despite some subscribers not having 
received the deleted bundles. 

5 Conclusion 

The concept of DTNs has matured significantly in the last decade, with a myriad 
of novel applications that range from tracking wildlife to exploiting the movement 
of humans. A notable application is providing connectivity to rural areas that oth- 
erwise would have been isolated from the Internet. As a result, like the Internet, it 
is almost impossible to predict what economical, social and technological impact 
DTNs will have in the future. 

A key underpinning that will ensure the success of DTNs is the performance 
of routing protocols. In this respect, this paper has reviewed three main categories 
of routing protocols, where we provide a comprehensive qualitative comparison 
of key features within as well as between categories. Furthermore, we presented 
the handful of works that aim to design efficient multicast routing protocols. We 
also highlighted the research methodologies used by existing works and the lack 
of a common set of parameters that can be used to evaluate existing protocols ob- 
jectively. Finally, we review an emerging area that aims to incorporate research 
from social networks and epidemiology in the hope of identifying invariant prop- 
erties that will help enhance the performance of routing protocols. We believe this 
to be the next frontier in DTNs research, especially in light of ubiquitous, power- 
ful mobile devices that are equipped with ample storage, camera and positioning 
capability that are capable of carrying data as well as collecting context and social 
information. 
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